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METHODS AND COMPOSITIONS FOR MODULATING "MARGINALLY 
INDISCRIMINANT" HYBRIDIZATIONS 

RELATED APPLICATIONS 
This application claims the benefit of priority under 35 U.S.C. 119(e) to 
copending U.S. Provisional Application No. 60/095,313, filed on August 4, 1998, the 
entire contents of which are incorporated herein by reference. 

BACKGROUND OF THE INVENTION 
The ability to control hybridization of a nucleic acid strand (a probe) to its 
complement, while excluding imperfectly base-paired probe hybridization has been 
central to the advancement of both molecular biological techniques and to design of 
nucleic acid diagnostic systems. Much attention has been paid to this issue because 
identification of a particular mutant nucleic acid sequence fully complementary to a 
probe can permit detection of, for example, the existence of a mutant sequence 
(genetic disorders) or a particular virulent bacterial or viral strain in a patient. Thus, 
studies of the physics of mismatched probe:target energetics has focused on the 
difference in free energy of such mismatches with the hope that such knowledge will 
benefit the development of assays in which such mismatches are excluded. For 
example, in diagnosis of a genetic disorder a mutant probe for targeting a nucleic acid 
molecule having a sequence containing a single base mismatch associated with such a 
disorder would produce a false positive result if the probe also hybridizes to the wild- 
type (unmutated) sequence. In other words, the assay must be sufficiently 
discriminatory in order for the probe to bind to the molecule having the mutated base 
and not to the molecule lacking the mutated base. If the probe hybridizes to both 
molecules, then the hybridization result would indicate the presence of the single base 
mutated sequence even though it was not, in fact, contained in the tested sample. In 
general, the environment of the reaction is manipulated to eliminate such mismatch 
probe:target interactions by modifying the physical conditions for hybridization (e.g., 
temperature and or time) or composition of the hybridization buffer (e.g., salt, 
divalent ions denaturing agents, etc.). 
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On the other hand, however, it is advantageous, in some applications, to have a 
probe which is known to hybridize with molecules containing particular mismatched 
sequences (a "marginally indiscriminant" probe) within a desired degree of homology 
to the probes' perfect complement. This would permit a single probe to be used in an 

5 assay for determining the presence of nucleic acid molecules containing any of the 
mismatched sequences. Such an assay would thus reduce, or possibly even eliminate, 
the need for more than one probe, each containing a nucleic sequence precisely 
corresponding to a sequence of a target molecule. Achievement of such a probe could 
be useful as a "multiplex" (multiple assays from one probe) probe. To date, for 

10 example, conventional multiplexing has relied upon the inclusion of multiplex specific 
probes into one cocktail reaction (e.g., multiplex polymerase chain reaction (PCR)), 
rather than just one probe. 

There are always going to be constraints on an indiscriminant probe. It would 
be generally acceptable for a probe to hybridize to any nucleic acid molecule whether 

15 complementary or not (although there may be limited use for such a probe in 

detecting the presence or absence of any DNA). This type of probe and/ or conditions 
for hybridization of the probe would detect even sequences which shared no 
homology with the probes' complement. At the other end of the spectrum, it is 
desirable that when designing a marginally indiscriminant probe for detecting viral 

20 nucleic acid sequences, for example, to design a probe such that a single probe will 
pick up all known of sequence within a limited degree of homology (say 10, 20, 30, 40 
or 50% homology). 

There are known approaches for detecting target nucleic acids by hybridization 
of a probe having a nucleic acid sequence fully complementary to or substantially 

25 complementary to a sequence of a target nucleic acid. Methods have thus been 

developed to detect viral nucleic acid sequences and their variants by hybridization 
using probes fully complementary to or substantially complementary to the viral 
nucleic acid sequences, as exemplified by US. Patents Nos. 5,008,182; 5,079,351; 
5,268,268; 5,567,603; 5,594,122; 5,594,123; 5,599,662; and 5,733,781, the text of which is 

30 incorporated herein by reference. 
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The specifications of these patents disclose methods and compositions of 
nucleic acids for as probes for detecting nucleic acid sequences of the family of Human 
T-cell Leukemia Viruses (HTLV) and the Human Immunodeficiency Virus (HIV), HIV 
and its variants are thought to be responsible for the acquired immunodeficiency 

5 syndrome (AIDS). The probes and methods disclosed in these patents for detecting 
the presence or absence of the viral DN A utilize probes to conserved regions of these 
viruses, but the disclosed approaches have limited applicability. This is because of the 
now well-known genetic variability of human immunodeficiency viruses. Genetic 
variations arise with high frequency. This variability has complicated the 

10 development of assays for detecting the presence of their genetic material. Further, 
while a comparison of various HIV-1 isolates has revealed, regions of the genome that 
are reasonably well conserved, it is possible that even the conserved regions, regions 
to which the probes have been designed to hybridize, may at mutate in the future. If 
so, probes designed for detecting the conserved regions may not hybridize to the one 

15 conserved region as a result of base mismatches. 

As a further example, U.S. Patent No. 5,567,603 describes probes for detecting 
HIV-3 that hybridize neither with the sequences of HIV-1 nor with the sequences of 
HIV-2 under stringent hybridization conditions. Thus, the ability to design a single 
nucleic acid probe and a method that will allow hybridization of the probe to all HIV 
20 strains and their variants but not to other non-target partially complementary nucleic 
acid sequences or other non-related viral nucleic acid sequences would have 
advantages over current approaches. 

SUMMARY OF THE INVENTION 
The present invention describes how a nucleic acid probe with mismatches to a 
25 target may be forced to hybridize to a target without hybridizing indiscriminantly 
with other non-target partially complementary nucleic acids. The methods of the 
invention require that nucleic acid duplex ligands as well as nucleic acid single-strand 
ligands be titered in concentration against one another to achieve the required degree 
of mismatch target hybridization without obtaining non-target hybridization. 
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In one aspect of the invention, a method of providing a nucleic acid molecule 
for potential use as a probe for a family of nucleic acid molecules in the present of a 
nucleic acid sequence binding ligand which will promote hybridization of the probe to 
all the member of the family of target sequences and not to non-target partially 
5 complementary sequences is provided. The method includes the steps of providing 
the family of first nucleic acid molecules wherein each member of the family is related 
to all other members of the family by a consensus sequence. A second nucleic acid 
molecule complementary to the consensus sequence is synthesized by methods well 
known in the art. It is highly preferable that the homology of this complementary 
10 sequence to other viral nucleic acid sequences as well as other sequences in general be 
determined by comparing its nucleotide sequence against those listed in a database 
(e.g., GenBank, DDBJ, EBI, or GSDB) to ensure that it does not by chance happen to 
have significant homology to other non-target partially complementary sequences. In 
addition, the homology of the complementary sequence should also be searched 
1 5 against all members of the family of target sequences to determine if the probe might 
hybridize to a region(s) other than it was originally intended to. Once it is determined 
that the complementary sequence will most likely not hybridize to a region(s) of the 
target sequence and all its family members other than it was intended to or to other 
non-target partially complementary nucleic acid sequences, the nucleic acid sequence 
20 complementary to the consensus sequence can be used as a probe. The ability of the 
probe to hybridize to the consensus region of the target nucleic acid sequence and all 
members of the family is then determined in the presence of a certain concentration of 
nucleic acid sequence binding ligand known to affect hybridization of the probe to the 
complementary region of the target sequence. This is repeated at several different 
25 concentrations of ligand, such that the concentration of ligand at which the probe is 
able to bind to the target nucleic acid sequence and all its family members equally well 
without affecting the hybridization of the probe to other non-target partially 
complementary nucleic acid sequences is the concentration of ligand that will be used 
for subsequent methods of the invention for that particular probe in detecting the 
30 presence of the its target nucleic acid sequence and its genetic variants. 
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In yet another aspect, hybridization of the probe to the target nucleic acid 
sequence and its family members can be further improved in the presence of two 
different nucleic acid binding ligands. 

In a preferred embodiment, a method of promoting the hybridization of a 

5 nucleic acid capture moiety comprising a nucleic acid sequence complementary to a 
consensus sequence of a target single-stranded nucleic acid sequence and all its family 
members without hybridizing to a plurality of other non-target partially 
complementary nucleic acid sequence suspected of being present in a sample is 
provided. The method includes the steps of identifying at least one consensus 

10 sequence to a region of the target duplex nucleic acid sequences and all its genetic 
variants suspected of being present in a sample; synthesizing a nucleic acid sequence 
complementary to the consensus sequence (probe); providing a nucleic acid capture 
moiety comprising the probe; a nucleic acid binding ligand, wherein the ligand has 
been selected to promote hybridization of the probe to the corresponding 

15 complementary region of the target nucleic acid and all its family members; the 
sample suspected of containing the target nucleic acid sequence and all its family 
members; and allowing the target single-stranded nucleic acid sequence and all its 
family members to the nucleic acid capture moiety comprising the probe without 
promoting the hybridization of other non-target partially complementary nucleic acid 

20 sequences. 

In yet another preferred embodiment, a consensus sequences of the target 
sequence and all its family members can be amplified by PCR if it is suspected that 
direct detection of the target nucleic acid sequences and all its family members may be 
different or impossible. In this case, the nucleic acid primers used for detecting the 

25 target nucleic acid sequences and all its family members should be designed such that 
the primers will allow amplification of the region of target nucleic acid sequence and 
all its family members (i.e., the consensus sequence) to be detected without 
simultaneous amplification of non-target partially complementary nucleic acid 
sequences from other viral nucleic acid sequences or human genomic nucleic acid 

30 sequences. The hybridization of the amplified region of the target sequence and all its 
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family members to a nucleic acid capture moiety comprising the probe can then be 
performed under conditions in which the presence of a nucleic acid sequence binding 
ligand will promote hybridization of the target single-stranded nucleic acid sequence 
and all its family members to the nucleic acid capture moiety without promoting the 
hybridization of other non-target partially complementary nucleic acid sequences. 

In a preferred embodiment, the invention described herein can be used to 
detect nucleic acid sequences of the AIDS virus and all of its family members without 
detecting other non-target viral nucleic acid sequences or human genomic DN A. 

In yet another preferred embodiment, the invention described herein can be 
used to detect nucleic acid sequences associated with infectious diseases, genetic 
disorders, or cellular conditions such as cancer in which the gene responsible for the 
pathological condition is known to be caused by several mismatch variant nucleic acid 
sequences. Examples of such genes include but are not limited to p53, ras, BRCA1, 
BRCA2, or APC. 

In another embodiment, the invention herein relates to a multi-container kit for 
detecting target nucleic acid sequences and their mismatch nucleic acid sequences 
suspected of being present in a sample, which kit comprises: 

(a) a nucleic acid capture moiety comprising a labeled probe nucleic acid 
sequence substantially complementary to a consensus sequence of the target 
duplex nucleic acid sequence and all its family members variants suspected 
of being present in a sample; and 

(b) at least one nucleic acid sequence binding ligand, wherein the ligand 
promotes hybridization of the target single-stranded nucleic acid sequence 
and all its family members to the nucleic acid capture moiety comprising the 
labeled probe sequence without promoting the hybridization of other non- 
target partially complementary nucleic acid sequences. 

Preferably, the kit also contains nucleic acid sequences primers for amplifying 
the region of the target nucleic acid sequence and all its family members to be 
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detected, an agent for polymerization and four different nucleosides. It is also 
preferable that the kit contain the relevant positive and/ or negative controls. 

In preferred embodiments, the label of the labeled probe nucleic acid sequence 
is then selected from the group consisting of antibody, antigens, radioisotopes, 
fluorescent, enzyme, lecithin or biotin. 

In a preferred embodiment, the components of the kit are designed to detect the 
AIDS virus and all its family members without detecting non-target viral nucleic acid 
sequences and other non-target nucleic acid sequences. 

In another preferred embodiment, the components of the kit are designed to 
detect nucleic acid sequences associated with infectious diseases, genetic disorders, or 
cellular conditions such as cancer in which the gene responsible for the pathological 
condition is known to be caused by several mismatch variant nucleic acid sequences. 
Examples of such genes include but are not limited to p53, ras, BRCA1, BRCA2, or 
APC. 

BRIEF DESCRIPTION OF THE FIGURES 
Figure 1 shows the effect of titrating of four DNA binding ligands in a DN A 
hybridization reaction in which the target molecules were labeled with the 
radioisotope 32 P. Dark bands indicate unbound target (i.e., higher band intensity=less 
bound target). The control lane shows the total intensity of unbound target. 
Hybridization was allowed to occur under the conditions described herein. At the 
end of the specified time period for hybridization, the amount of unbound target was 
determined by gel electrophoresis. 

Figure 2 shows the effect of titrating combinations of four DNA binding ligands 
in a DNA hybridization reaction in which the target molecules were labeled with 32 P. 
Distamycin A was held constant at lrnM for those sets with drug combinations. 
Hybridization was allowed to occur under the conditions described herein. At the 
end of the specified time period for hybridization, the amount of unbound target was 
determined by gel electrophoresis. 
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Figure 3a shows the time dependence of target:probe hybridization in the 
presence and absence of distamycin A and ethidium bromide. The arrow indicates the 
perfect matched probe. The hairpin and target sequences are listed in the inset. At the 
end of the specified time period amount of unbound target was determined by gel 
5 electrophoresis. 

Figure 3b shows graphs of normalized binding curves from the data obtained 
in Figure 3a. Gray circles indicate hybridization without DNA ligands. Black circles 
indicate hybridization in ImM distamycin and O.OOlmM ethidium bromide. 

Figure 4 shows the effect of salt concentration dependence of denaturation of 
10 target:probe hybrids of 40% (v/v) formamide. The buffer was lOmM phosphate, 
pH 7.2 and the specified concentration of NaCl. Wash incubation time was held 
constant at 1 hr. The amount of unbound target at the end of the wash incubation 
time was determined by gel electrophoresis. 

Figure 5a shows the effect of formamide concentration when cross-tittered with 
15 distamycin A in the wash buff er on denaturation of target: probe hybrids. Darker 
bands indicate a higher degree of dissociation. At the end of the wash period, the 
amount of 32 P labeled target was determined by gel electrophoresis. 

Figure 5b shows graphs of binding curves from the data obtained in Figure 5a 
with each of the constructs shown. 

20 Figure 5c shows graphs of the fraction of target:probe hybrids remaining after 

denaturation as a function of formamide at different concentrations of distamycin A 
with each of the constructs shown. 

DETAILED DESCRIPTION OF THE INVENTION 
The methods and unique compositions of the invention are useful in the 
25 detection of nucleic acid sequences and all related family members without detecting 
non-target partially complementary nucleic acid sequences. The methods of the 
invention can be performed with nucleic acid capture moieties immobilized on a solid 
support such as multi-well plates, membranes or gene chips. The methods of the 
invention can also be automated, in part, to speed screening and improve economy. 
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The term "single-stranded nucleic acid", as used herein, refers to a duplex 
nucleic acid which has been denatured resulting in two single-stranded nucleic acid 
sequences of DNA or RN A. Methods of denaturing duplex nucleic acid sequences are 
well known to those skilled in art. Single-stranded nucleic acid can also mean a mixed 
5 DN A-RN A strand, or nucleic acid-like compounds such as peptide nucleic acids. A 
nucleic acid strand can also include modified (e.g., chemically or biochemically 
modified) DNA or RNA bases, of which many are known in the art. 

The terms "target nucleic acid sequence", "target nucleic acid" or "target strand" 
refer to a nucleic acid sequence which is to be detected, sequenced, immobilized, or 

10 manipulated. The target nucleic acid sequence can be any nucleic acid strand, as 

defined above, and in general will be single-stranded or will be made single-stranded 
by methods known to those skilled in the art. The target nucleic acid sequence can be 
obtained from various sources including plasmids, viruses, bacteria, fungi, yeast, 
plants, and animals, including humans- or the target nucleic acid sequence can be 

15 obtained from non-natural sources. The target nucleic acid sequence can be obtained 
from various organisms or tissues, including fluids such as blood, semen, urine and 
the like. The target nucleic acid sequence is preferably extracted or purified to remove 
or reduce contaminating or interfering materials such as proteins or cellular debris. 
Procedures for such purification or extraction of target nucleic acids sequences are 

20 known in the art, including, for example, those described in Maniatis et al v "Molecular 
Cloning: A Laboratory Manual* , Cold Spring, Harbor Laboratory (1989), or in Bell et 
al., Proa Nat Acad ScL USA (1991), 78:5759-576. The methods and compositions of 
the inversion are particularly useful in the detection of nucleic acid sequences 
associated with infectious diseases, genetic disorders, or cellular conditions such as 

25 cancer. 

In one aspect, the invention features a nucleic acid capture moiety which has at 
least one nucleic acid sequence complementary to at least one consensus sequence of a 
target nucleic acid sequence and having at least two nucleic acid sequence regions 
which are capable of forming an intramolecular duplex. The capture moiety can be 
30 immobilized on the solid support before, simultaneous with, or after capturing the 
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single-stranded target nucleic acid sequence. A nucleic acid capture moiety can 
"capture" a target nucleic acid sequence by hybridizing to the target nucleic acid 
sequence and thereby immobilizing the target nucleic acid sequence and all its family 
members. In preferred embodiments, the nucleic acid capture moiety comprises a 
nucleic acid sequence strand which has at least one nucleic acid sequence which is 
complementary to a consensus sequence of the target nucleic acid and all its family 
member. One example of a nucleic acid capture moiety is a nucleic acid hairpin. A 
"hairpin" is a double-helical region in a single DNA or RN A strand formed by the 
hydrogen bonding between adjacent inverse complementary sequences along the 
nucleic acid strand. The use of a nucleic acid hairpin as a nucleic acid capture moiety 
has been described in detail in U.S. Patent No. 5,770,365 issued to the current 
applicants, the disclosure of which is incorporated herein by reference. In certain 
embodiments, the nucleic acid sequence capture moiety, whether a single-stranded 
nucleic acid sequence or a imcleic acid hairpin, may be labeled as with, e.g., a 
radioisotope, a fluorescent moiety, an antibody, an antigen, a lecithin, an enzyme, 
biotin or other labels well known in the art. Alternatively, the target sequence may 
also be labeled or labeled secondary probes may be employed. A "secondary probe" 
is a nucleic acid sequence which is fully complementary or substantially 
complementary to a region of the target nucleic acid sequence or to a region of the 
nucleic acid capture moiety. "Substantially complementary" as used herein means that 
the sequence must be sufficiently complementary to the nucleic acid being detected 
such that hybridization will take place under the conditions employed. Alternatively, 
a nucleic acid capture moiety can also be a linear nucleic acid sequence such as a single 
stranded DNA or RN A nucleic acid comprising at least one nucleic acid sequence 
which sequence is complementary to the consensus sequence to the consensus 
sequence to all members of the family of target sequences. 

As used herein, the term "consensus sequence" means an idealized sequence 
that represents the nucleotides most often present at each position in a given segment 
of all members of the family of target sequences. One method of determining a 
consensus sequence is to use a computer program to compare the target nucleic acid 
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sequence and all its family member sequences for which a consensus sequence is 

desired. For this purpose, a commercial program with the underlying computer 

algorithm provided by the National Biomedical Research Foundation using a dot 

matrix may be conveniently employed. The program involves inputting the nucleic 

acid sequences of the target nucleic acid sequence and all its generic variants and 

defining a window size for base pair homology. The program employs graphics to 

compare the sequences on different axes, and a dot appears where there is at least 

substantial homology. As used herein, the term "target nucleic acid sequence and all 

its genetic variants" refers to the wild-type nucleic acid sequence and all base 

mismatch variants of the wild-type sequence. Once the consensus sequence has been 

determined a nucleic acid sequence complementary to the consensus sequence is 

synthesized by methods well known in the art. It is preferable, however, that prior to 

synthesizing the complementary sequences, the complementary sequence be searched 

against a plurality of nucleic acid sequences listed in one or more of the nucleic acid 

sequence databases which include but is not limited to the DN A Data Bank of Japan 

(DDBJ), the European Bioinf ormatics Institute (EBI), GenBank, or the Genome 

Sequence Database (GSDB) to determine if the complementary sequence has 

significant homology to other non-target partially complementary nucleic acid 

sequences. If the consensus sequence happens by chance to have significant homology 

to another non-related viral nucleic acid sequence or to an unrelated human sequence, 

a new consensus sequence to another region of all members of the family of target 

sequences can be selected and the above process repeated. By doing this, false 

positives can be eliminated. It is also preferable that the complementary sequence be 

searched against all members of the family of target sequences to determine if the 

complementary sequence might hybridize to a region(s) other than it was originally 

intended to. Once it is determined that the complementary sequence will most likely 

not hybridize to a to a region(s) of the target sequence and all its family members 

other than it was intended to or to other non-target partially complementary nucleic 

acid sequences, the nucleic acid sequence complementary to the consensus sequence 

can be synthesized and used as a probe. Otherwise, a new consensus sequence should 

be selected and the above process repeated. 
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"Sequence identity or homology", as used herein, refers to the sequences 
similarity between two nucleic acid molecules. When a position in both of the two 
compared sequences is occupied by the same base, e.g., if a position in each of two 
DN A molecules is compared by adenine, then the molecules are homologous or 
sequence identical at that position. The percent of homology or sequence identity 
between two sequences is a function of the number of matching or homologous 
identical positions shared by the two sequences divided by the number of positions 
compared x 100. For example, if 6 of 10, of the positions in two sequences are the 
same, then the two sequences are 60% homologous or have 60% sequence identity. By 
way of example, the DNA sequences ATTGCC and TATGGC share 50% homology or 
sequence identity. Generally, a comparison is made when two sequences are aligned 
to give maximum homology. Unless otherwise specified "loop out regions", e.g., those 
arising from deletions or insertions in one of the sequences are counted as mismatches. 

The comparison of sequences and determination of percent homology between 
two sequences can be accomplished using a mathematical algorithm. Preferably, the 
alignment can be performed using the Clustal Method. Multiple alignment 
parameters include GAP Penalty =10, Gap Length Penalty =10. For DNA alignments, 
the pairwise alignment parameters can be Htupla =2, Gap penalty =5, Window =4 and 
Diagonal saved =4. For protein alignments, the pairwise alignment parameters can be 
Ktuple =1, Gap penalty =3, Window =5, and Diagnosis Saved =5. 

Additional non-limiting example of a mathematical algorithm utilized for the 
comparison of sequences is the algorithm of Karlin and Altachu (1990) Proc. Nam. 
Acad. Sci. USA 87:2264-68, modified as in Karlin and Altachu (1993) Proc. Natn. Acad. 
Sci. USA 90:5873-77. Such an algorithm is incorporated into the NBLAST and XBLAST 
programs (version 2.0) of Altachu, et al., (1990) J. Mol Biol. 215:403-10. BLAST 
nucleotide searches can be performed with the NBLAST program, score =100 
wordlength =12 to obtain nucleotide sequences homologous to nucleic acid molecules 
of the invention. BLAST protein searches can be performed with the XBLAST 
program, score =50, wordlength =3 to obtain amino acid sequences homologous to 
protein molecules of the invention. To obtain gapped alignments for comparison 
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purposes, Gapped BLAST can be utilized as described in Altachu et al., (1977) Nucleic 
Acids Research 25(17):3389-3402. When utilizing BLAST and Gapped GLAST 
programs, the default parameters of the respective programs (e.g., XBLAST and 
NBLAST) can be sued. See http://www.nebi.nlm.nih.gov. Another preferred non- 
limiting example of a mathematical algorithm utilized for the comparison of 
sequences is the algorithm of Myers and Miller, CABIOS (1989). Such an algorithm is 
incorporated into the ALIGN program (version 2.0) which is part of the GCG sequence 
alignment software package. When utilizing ALIGN program for comparing amino 
acid sequences, a PAM120 weight residue table, a gap length penalty of 12, and a gap 
penalty of 4 can be used. 

The compositions and methods of the invention generally feature the use of at 
least one base-preferring binding ligand (or, in some cases, sequence-specific ligand) 
to promote hybridization of a probe nucleic acid sequence to promote hybridization of 
the target single-stranded nucleic acid sequence and all its family members to the 
nucleic acid capture moiety without promoting the hybridization of other non-target 
partially complementary nucleic acid sequences. The methods and compositions of 
the invention can also include one or more additional binding ligands, which can be 
base-preferring or sequence-specific ligands, or non-specific ligands, and can bind 
duplex nucleic acid sequences or single-stranded nucleic acid sequences. The term 
"nonspecific binding ligand", as used herein, refers to a nucleic acid binding ligand 
that does not substantially preferentially bind to nucleic acid sequences in which one 
or more specified bases predominate. That is, a "nonspecific binding ligand" binds to 
all, or a large variety of, bases or sequences approximately equally well. The choice of 
appropriate ligands will be routine to the skilled artisan in light of the teachings 
herein, as explained in more detail below. 

Ligands suitable for use in the present invention are capable, in general, of 
binding to nucleic acid single strands and/ or duplexes. In general, it is necessary to 
provide at least one base-preferring ligand in the reaction mixtures of the invention. 

A variety of base-preferring ligands have been described. For example, the 
duplex-binding ligand distamycin A has been reported to bind preferentially to AT- 
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rich sequences. Other base-preferring, duplex-binding ligands include certain 
restriction enzymes, drugs such as actinomycin D (which has a primary binding site of 
S'-GC-S', and a secondary preference for GT sites) and intercalators such as ethidium 
bromide (as described below). 

Similarly, base-preferring single strand-binding ligands can be employed in the 
invention. 

The method of the invention is particularly useful for detecting genetic variants 
of a target nucleic acid sequence by hybridization using a single probe in the presence 
of a pre-selected nucleic acid binding ligand under conditions such that the nucleic 
acid binding ligand will promote hybridization of the target nucleic acid and all its 
genetic variants with the probe but not to other non-target partially complementary 
nucleic acid sequences. Specifically, the method of the invention can be used to detect 
the presence of the AIDS virus nucleic acid sequence and all its genetic variants. A 
candidate consensus sequence to a particular region of the viral nucleic acid sequence 
and all its family members is first selected. A second nucleic acid sequence 
complementary to the consensus sequence, the probe, can then be synthesized by 
methods well known in the art. It is preferable that the nucleic acid sequence of the 
probe be compared to a plurality of nucleic acid sequences in a database to rule out 
the possibility that other non-related viral nucleic acid sequences or other nucleic acid 
sequences with significant homology may hybridize to the probe, resulting in false 
positives. The conditions under which a nucleic acid ligand will promote 
hybridization of the probe sequence to the AIDS virus nucleic acid sequence and all its 
family members is then determined. A second nucleic acid ligand different from the 
first can also be used to further improve hybridization of the probe to the AIDS virus 
nucleic acid sequence and all its genetic variants without promoting hybridization of 
the probe to other non-target partially complementary sequences. If it is suspected 
that the amount of AIDS viral nucleic acid sequence present is below the level of direct 
detection of the method herein, the consensus sequence van be amplified by PCR 
using consensus sequence primers immediately adjacent to the region to be detected. 
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Similarly, the method of the invention can be used to detect the presence of any 
nucleic acid sequences associated with infectious diseases, genetic disorders, or 
cellular conditions such as cancer in which the gene responsible for the pathological 
condition is known to be caused by several mismatch variant nucleic acid sequences. 
Examples of such genes include but are not limited to p53, ras, breast cancer antigen 1 
(BRCA1), or breast cancer antigen (BRCA2). 

The present invention will now be illustrated, but is not intended to be limited 
by the following examples: 

General Methods 

A. Constructs Used 

The biotinylated DN A capture hairpin (hairpin) was purchased from a 
commercial supplier (Oligos Therapeutics), wit the following structure: 

TTCCTGGTGCAGCTGATC-5' 
/ 

U* 

TTGGACCACGTCGACTAGGGCTCCTCTGCGATCCATA-3' 
The duplex region will henceforth be referred to as the "stem." The 5 bases 
forming a single-stranded loop on one end of the hairpin will be referred to as the 
"hairpin loop" or "loop." "U*" refers to biotinylated U, used for attaching the hairpin to 
a solid support (in this case, streptavidin-coated microtiter plates). The single- 
stranded region (shown above in bold-face) will be referred to as the "dangling end." 

Single-stranded DN A molecules fully or partially complementary to the 
dangling end (referred to as "probe") were also purchased from the same supplier. 
These molecules were of different lengths to allow them to be separated and 
visualized by PAGE. The sequences are: 

15-mer perfect match: 5'-TAT GGA TCG GCA GAG-3' 

17-mer mismatch: 5'-AT TAT GGA TCG GCA GAT-3' 

19-mer mismatch: 5-AAAT TAT GGA TCG GCG GAG-3' 
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21-mer mismatch: 5'-TAAAAT TAT GGA TCT GCA GAG-3 1 

23-mer mismatch: 5'-TTTAAAAT TAT GGG TCG GCA GAG-3' 

Note that the mismatches are longer than the 15-mer perfect-matched sequence 
on the 5'-end. 

The duplex molecules formed are: 

1. TTCTGGTGCAGCTGATC-5' GAGACGGCTAGGTAT-5' 
/ 

tr 
\ 

TTGGACCACGTCGACTAGGGCTCCTCTGCGATCCATA-3' 

2. TTCTGGTGCAGCTGATC-5' TAGACGGCTAGGTATTA-5' 
/ 

IT 
\ 

TTGGACCACGTCGACTAGGGCTCCTCTGCGATCCATA-3 1 

3. TTCTGGTGCAGCTGATC-5' GAGGCGGCTAGGTATTAAA-5 1 
/ 

U* 
\ 

TTGGACCACGTCGACTAGGGCTCCTCTGCGATCCATA-3 1 
B. Reaction Conditions 

1. 32 P labeling of the target molecules 

The five target molecules were labeled with 32 P following a standard kinasing 
protocol. The labeled bands were isolated from the reaction solutions by denaturing 
PAGE (8 M. urea, 20% actylamide). 32 P activity was determined by scintillation 
counting. 

2. Capture hairpin immobilization on microtiter plates 

A solution of the capture hairpin at 10 pmol/SODl in PBS (150 mM NaCl, 10 
mM phosphate, pH 7.2) was prepared. 50Dl/well was loaded on streptavidin-coated 
microtiter plates (Boehringer-Mannheim #1645692) and allowed to incubate for 30 min 
at room temperature. After the incubation period, the wells were washed 6 times with 
PBS, and blotted on clean Kimwipes. 
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3. General procedure for hybridization 

A cocktail of the labeled targets was prepared by adding a sufficient amount of 
each target to the hybridization mixture to give a final concentration of -20,000 
cpm/target/ 25D1. The final composition of hybridization mixture is 1M NaCl, lOmM 
5 phosphate, pH 7.2, and the specified concentration of the ligand. 25 Dl of the target 
cocktail was loaded into each well and the plate was incubated for the specified 
amount of time. After incubation, each reaction mixture as quantitatively transferred 
to a 0.2 ml tube (Costar 6547). 

The samples were analyzed by denaturing PAGE as follows; 10 Dl of loading 
10 dye (8 M urea, 5 mM Tris-HCl, pH 7.5, 100 mM EDTA, 0.01% bromophenol blue, 
0.01% xylene cynol) was added to the tube, and the whole sample was loaded onto a 
15% acrylamide/lx TBE/7 M urea gel PAGE was run at 20 mA/gel for 2 hours. 
After electrophoresis, the gels were visualized by autoradiography. 

4. General procedure for denaturation 

15 The hybridization mixture was incubated typically for 2 hours, under the 

specified conditions (i.e., hybridization buffer + ligands). After incubation, the 
reaction mixture was removed, and the wells washed once with 100 pi 1M 
NaCl phosphate, pH 7.2. The plate was blotted on Kimwipes, and 50 jal of the 
specified denaturation buffer was added and allowed to incubate for the specified 

20 amount of time. The mixture was then quantitatively transferred to o.2 pi of loading 
dye was added, and the sample analyzed by PAGE as above. 

The autoradiograms were done by exposing X-ray films (Kodak X-OMAT) to 
the gels overnight, using an image intensifying screen. In some cases, there is a lane 
marked "control." This is a reference lane loaded with an equal volume (25 jal) of 
25 unhybridized target cocktail. Also, in each denaturation set of experiments, there is a 
lane marked "initial." This lane was loaded with the reaction mixture after 
hybridization, which indicates how much of the target has bound. 
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EXAMPLE 1 
Effect Of Single Ligands Of Hybridization 
An hybridization experiment was done where the following binders (see 
Table 1) were titrated: actinomycin D. distamycin A, ethidium bromide, and single- 
strand DNA binding protein (SSB). Incubation time was held constant at 2 hours. The 
results are shown in Figure 1. 

Results 

1. Addition of actinomycin D to the hybridization reaction decreased the extent of 
hybridization in all cases. It acted as a single-strand binder (i.e., denaturant), with 
the activity proportional to the concentration. 

2. Distamycin A improved binding up to a concentration of 0.016 mM, but did not 
improve binding (compared to the control with no ligands) at higher 
concentrations. 

3. Ethidium bromide died not seem to affect the extent hybridization up to a 
concentration of 0.001 mM, and it inhibited hybridization of the longer mismatches 
(19-21, 23-mer sequences) from a concentration of 0.004 mM and higher. There are 
no bands at ImM. However, there was a strong band at the top of the gel (data not 
shown). 

4. SSB did not have an effect on the hybridization up to a concentration of 0.78 
Hg/well. However, a decrease in the extent hybridization was observed at the 
higher SSB concentrations. 

EXAMPLE 2 
Effect of a Ligand Combination of Hybridization 
In this experiment, different combination of ligands were used. The titration of 
distamycin A was repeated (see above), and in three other sets, distamycin A 
concentration was fixed at 1 mM and the other ligands were titrated. The results are 
sown in Figure 2. 
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Results 

1. The distamycin A titration experiment showed nearly identical results with the 
first run. An improvement in the extent of hybridization was observed up to a 
concentration of 0.016 mM, with no improvement at higher concentrations. 

5 2. Titration of actinomycin D in the hybridization mix with a constant amount of 
distamycin A showed markedly different results than when actinomycin D was 
used alone. A comparison of the two experiments (II-l and III-2) showed that 
when actinomycin D was used alone, a decrease in the extent of hybridization was 
apparent even at the lowest concentration used (0.00025mM). When actinomycin 

10 D was used in combination with distamycin A, a decrease in the extent of 

hybridization was noted at 0.004 mM or higher, an approximately 16-fold higher 
concentration. 

3. A combination of distamycin A and ethidium bromide showed a similar effect. 
While there was a decrease in the hybridization at > 0.001 mM ethidium bromide 

15 when it was used alone, there was no decrease in hybridization when it was used 
in combination with distamycin A. Similar to the previous experiment, at ImM 
ethidium bromide, all the unhybridized target was noted at the top of the gel (data 
not shown). 

4. Distamycin A apparently did not have an effect on the activity of SSB. The results 
20 of the distamycin A / SSB combination are similar to the results when SSB alone 

was titrated. 
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EXAMPLE 3 

Effect of a Ligand Combination on Hybridization Time 
The previous experiment showed that a combination of ligands (i.e., 
distamycin A and ethidium bromide) may improve the extent of DN A hybridization. 
A hybridization kinetics experiment was performed where the extent of hybridization 
in the absence of ligand (i.e., hybridization buffer only) and with a combination of 
ligands (1 mM distamycin A + 1 jiM ethidium bromide) were compared as a function 
of time. 

Results 

The results are shown in Figure 3a. A comparison of the band intensities at 40 
and 60 minutes shows an improvement in the hybridization in the presence of ligand. 
This trend is more clear when the intensities are measured (NIH Image) and plotted as 
shown in Figure 3b. 
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EXAMPLE 4 
Effect of Denaturation on Hybridization 
In this set of experiments, we used various combinations of salt concentration, 
distamycin A, and the formamide (a denaturant), to control the extent of duplex to 
single-strand dissociation. The same set of molecules as in the previous section was 
used. 

Hybridization of the target cocktail to the capture hairpin was carried out 
following the procedure described in General Methods. The final composition of the 
hybridization buffer was 1M NaCl, 10 mM phosphate, pH 7.2. The samples were 
incubated for approximately 2 hours at room temperature, and washed once with the 
hybridization buffer. 

The denaturation buffer was 10 mM NaCl, 10 mM phosphate, pH 7.2, and NaCl 
tittered from 0 to 1M. The results are shown in Figure 4. The amount of target 
dissociating from the capture hairpin decreased with an increase in the salt 
concentration with the 15-mer perfect match showing the greatest change. 

Results 

1. Salt Concentration Dependence of Denaturation 

The wash buffer consisted of 40% formamide, 10 mM phosphate pH 7.2, and 
NaCl tittered from 0 to 1M. The results are shown in Figure 4. The amount of target 
dissociating from the capture hairpin decreased with an increase in the salt 
concentration, with the 15-mer perfect match showing the greatest change. 

2. Formamide and Distamycin A Concentration Effect on D issociation 

Figure 5a shows a denaturation experiment where formamide was cross- 
tittered with distamycin A. The buffer concentration was kept constant at 10 mM 
NaCl, 10 mM phosphate pH 7.2 Formamide was tittered from 20-35% at 
2.5 increments, while distamycin A was tittered from 0.062 mM to ImM in 4-fold 
increments. With no distamycin A, the stabilities of the mismatched targets increased, 
as shown by the decrease in their respective band intensities. This effect becomes 
more clear when the bands are quantified and plotted, as shown in Figure 5b (with 
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distamycin A independent variable), and in Figure 5c (with formamide as the 
dependent variable). 

3. Time Dependence of Dissociation as a Function of Distamyc in A Concentration 

An experiment was fun where the extent of dissociation over time was 
measured as a function of distamycin A concentration in the wash buffer. The 
formamide concentration was kept constant at 40% (v/v), and distamycin A was 
tittered in 4-fold increments, at 0,0,62 mM, 0.25 mM, and 1 mM. Time points were 
from 0 (wash buffer was added and pulled out) to 60 min. The results are shown in 
Figure 6. With 0- 0.062 mM distamycin A , denaturation dissociate to the same extent 
as in the previous drug concentration. At 1 mM distamycin A, all target molecules 
show a lower extent of dissociation, with the perfect match showing a marked increase 
in stability. 
EQUIVALENTS 

Those skilled in the art will recognize, or be able to ascertain using no more 
than routine experimentation, numerous equivalents to the specific procedures 
described herein. Such equivalents are considered to be within the scope of this 
invention and are covered by the following claims. The contents of all references, 
issued patents, and published patent applications cited throughout this application are 
hereby incorporated by reference. 
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